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f| ' A formulation of bit-string models of language evolution, based on differential equations 

^ I for the population speaking each language, is introduced and preliminarily studied. Con- 

nections with replicator dynamics and diffusion processes are pointed out. The stability 
of the dominance state, where most of the population speaks a single language, is an- 
alyzed within a mean-field-like approximation, while the homogeneous state, where the 
population is evenly distributed among languages, can be exactly studied. This analysis 
discloses the existence of a bistability region, where dominance coexists with homogene- 
ity as possible asymptotic states. Numerical resolution of the differential system validates 
these findings. 
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1. Introduction 

The dynamics of language has recently been identified as a rich field for interdis- 
ciplinary application of statistical techniques traditionally associated with niathe- 
5^ \ matics and physics. A host of analytical and numerical models have been proposed, 

aimed at reproducing -more or less quantitative ly- several aspects of language as 
a dynamical sociocultural entity! I I '^ ll '-' l " ! ' l ° l ^ l Among them, models of language 
evolution focus on the joint processes of mutation of linguistic fea tures and of lan- 
guage acquisition, switching, and adoption by human populations! ^ ^^ 

Bit-string models of language evolution, prototyped by Schulze's model,!^ con- 
ceive that a language is completely characterized by a sequence of dichotomic prop- 
erties. Each of them represents, for instance, whether the language in question pos- 
sesses certain grammatical property or not. The sequence is naturally represented 
as a string of binary variables (bits), whose length is the number of yes/no ques- 
tions which fully identify a language. Emphasis in the stu dy of bit-strin g language 
models has been put on agent-based numerical simulations) l^ ^ ^l addressing 
possible explanations for the distribution of population among languages and the 
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abundance of language families. The language spoken by an individual can mu- 
tate at a certain rate, and each individual can also abandon his/her language and 
adopt a different one copied from a randomly chosen member of the population. 
Upon variation of the mutation rate, simulations show a sharp transition between a 
state of dominance, where most of the population speaks the same language, a nd a 
situation where the population is homogeneously distributed among languages! " ^ I 
In this paper, I introduce a formulation of bit-string language models based on 
differential equations for the population fraction speaking each language. The evo- 
lution turns out to be a combination of replicator dynamics, representing language 
switching, and diffusion in the bit-string (hypercubic) space, representing mutation. 
While the stability of the homogeneous state can be exactly analyzed, the study 
of the dominance state requires a mean-field-like approximation. The results, nev- 
ertheless, are in very good agreement with numerical resolution of the differential 
equations and with agent-based simulations. They predict a region of bistability, 
where dominance and homogeneity coexist as possible asymptotic states of the sys- 
tem, and the disappearance of the dominance state through a tangent bifurcation. 
The present formulation provides an alternative to simulations to trace language 
progress and regression in the framework of bit-string models. 

2. Analytical formulation of bit-string language models 

As advanced in the Introduction, in bit-string models an individual language is 
represented by a string of L binary variables (bits), each of them adopting one of 
two possible values, say, and 1. The total number of possible languages is iV = 2-^. 
Evolution is driven by two dynamical rules. First, the language of an individual can 
mutate with a certain probability per time unit. Each mutation event consists of a 
change in a single, randomly selected bit of the individual's language, from to 1 
or vice versa. Second, an individual can give up his/her language and adopt a new 
one. To account for the preference for more widespread languages, the new one is 
chosen by selecting an individual at random from the whole population and adopting 
his/her language. In this way, the probability of switching to a given language (say, 
to language i) equals the fraction of the population speaking i, denoted by Xi. 
Moreover, the probability of abandoning the original language (say, language j) 
is weighted by a monotonically decreasing function u{xj) of the fraction of the 
population speaking j. A language is thus less likely to be given up if it is spoken 
by a large fraction of the total population. Previous numerical analyses of bit-string 
models have considered the c ases u(x ) = 1 — x"^ and u{x) = (1 — a;)^, both of which 
satisfy u(0) = 1 and u(l) = O.EHnilSl 

Bit-string models are added with population dynamics, for instance, asexual 
reproduction with births at a given rate, and deaths with a mortality rate propor- 
tional to the population size. This insures that -after a transient, and up to small 
fluctuations- the total population remains constant in time. A newborn inherits 
the language of the parent, except possibly for mutation or switching to a new Ian- 
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guage, following essentially the same rules as described above. The effect of births 
on language evolution, thus, is effectively the same as mutation and switching by 
"adult" individuals. 

In its original formulation, Schulze's model considers that the features of a given 
("superior") language may be preferred to those of any other language. During mu- 
tation events, with a certain probability, a bit copies the value of the homologous bit 
in the "superior" language, enhancing the possibility that this language is eventu- 
ally spoken by most of the population. Here, I disregard this mechanism and assume 
that all languages are intrinsically equivalent. 

The dynamical rules of bit-string language models can be readily expressed 
as differential equations for the fractions Xi{t) {i = 1,...,N) of the population 
speaking each language, which satisfy the normalization condition 

N 

at all times. For the sake of simplicity, I will assume that the total population 
remains exactly constant in time, incorporating newborn language's mutation and 
switching to the respective overall mechanisms. 

The contribution of mutation to the time variation of Xi consists of a gain term 
proportional to L~^ ^ WijXj, where Wij = 1 if languages i and j differ by just one 
bit, and Wij — otherwise. The sum runs over all the languages. The gain term 
represents the fraction of the population speaking any language j which mutates to 
language i in a time unit. The factor L~^ accounts for the fact that a mutation event 
can lead, with identical probability, from each language j to L different languages. 
Correspondingly, there is a loss term proportional to L^^Xi ^ ■ Wji, which represents 
mutations of language i to other languages. In turn, the gain term associated to 
the mechanism of language switching is proportional to xt ^ ■ Xju{xj), because the 
probability of abandoning j is u{xj), and the probability of adopting z is a;^. The 
corresponding loss term is proportional to Xiu{xi) ^ ■ Xj. 

If pL and p are, respectively, the rates of mutation and switching events, the 
evolution of the fraction Xi (i = 1, 2, . . . , TV) is given by 



u{xi) + T y^ ^^j^o - ^'' y^ '^ji ■ (2) 





Summing these equations over the index i yields J2i ^t — 0, which insures that Eq. 
(dl) holds if the initial fraction satisfy normalization. The functions u{xi) make the 
equations nonlinear, so that little can be expected from trying to solve them exactly. 
It is worthwhile mentioning that a joint dynamical description of mutation and 
population growth -here corresponding to switching to widespread languages- has 
been pro posed for the process of language learning and the evolution of universal 
grammar! I I Several exact results are known for the relevant equations, though 
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they depend crucially on the fact that the functions which play a role analogous to 
that of u{x) are linear on the population fractions. 

Also, it is important to realize that the representation of languages as bit strings 
is essentially irrelevant to the form of the evolution equation ([2]) . As long as each 
language is interpreted as a kind of state to which a certain fraction of the population 
is assigned at each time, the specific way in which such states are individualized 
does not play a role in the equation. At most, it is necessary that those states are 
suitably labeled, in such a way as to discern the languages between which population 
can be transferred due to mutations, i.e. as to fix de coefficients Wij. Otherwise, the 
mathematical form of the equations will be the same irrespectively of the individual 
characterization of languages. Though, for clarity, I discuss Eq. ([2|) with reference 
to bit-string models, it is important to bear in mind that most results will also hold 
for other models driven by the mechanisms of mutation and switching. 

The contributions of mutation and switching to the evolution of Xi have oppo- 
site, competing effects. While mutation spreads individuals over different languages, 
switching tends to concentrate the population on languages with an already large 
number of speakers. It is useful to begin with a separate analysis of the two mech- 
anisms, which are closely related to well-understood processes in other areas of 
science. Then, one can proceed to show that the combination of the two compet- 
ing mechanisms gives rise to a critical transition between language diversity and 
domin ance, as effectively observed to occur in numerical simulations of bit-string 
models.l^U^ 



3. Connection with diffusion and with replicator dynamics 

3.1. Without switching: p = 

In order to isolate from each other the effects of mutation and switching, let us first 
disregard the latter, setting p = in Eq. ([2|). Taking into account that Wij — Wji, 
the dynamics without switching is governed by the equations 

N 
Xi^ J■^W^J{XJ-X^), (3) 

i = 1,2, . . . , N. This expression emphasizes the fact that mutation is a form of linear 
diffusion, with population transfer between languages at a rate proportional to the 
population difference Xj — Xi. The diffusion process takes place on the set of bit 
strings of length L, which can be assimilated to the vertices of the L-dimensional 
hypercube of unitary side. Mutation events transfer population between languages 
which differ in a single bit, i.e. between hypercube vertices at Hamming distance 
d = 1. Each vertex has L neighbour vertices at d = 1. 

Since Wij = 1 if the Hamming distance between i and j equals one, and Wij — 
otherwise, one has '^jWij = L. Therefore, introducing the hypercube adjacency 
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matrix W = {wij}, Eq. ([3]) can be rewritten as 

i = -/ix + y Wx = yX>x, (4) 

with X = (xi, a;2, . . . , xn) and I? = W — LX, where T is the identity matrix. The 
solution to the linear equation ([4]) can be found by standard methods, in terms of 
the eigenvalues A^ and eigenvectors e^. (fc = 1, . . . , N) of the matrix V. 

One of the eigenvalues of V, say Ai , equals zero, and all the remaining eigenvalues 
are negative. Consequently, the eigenvector corresponding to Ai, ei = (1, 1, . . . , 1), 
yields the equilibrium solution which, due to the normalization condition ([1]), is 
X* = N~^ for all i. In other words, starting from any initial condition, the system 
asymptotically reaches a homogeneous state. In the context of the original problem, 
in the absence of language switching, mutation leads for long times to a state where 
the population is homogeneously distributed among all possible languages. 

3.2. Without mutation: fi — 

If, now, /i = so that mutation is absent, Eq. ^ becomes a member of a well- 
studied class of nonlinear equations, generally written as 

N 
Xi = XiJiiX) — Xi y ^ XjJj[X.), (oj 

i = 1,2,..., A^, with x = (xi, a;2, . . . , xn)- Equations ^ define the so-called repli- 
cator dynamics! I I They describe the evolution of a system of interacting species 
in terms of the fractions xi = Ui/n, where rii is the number of individuals of species 
i and n = '^^rii is the total population in the system. The function /i(x) is the 
reproductive rate, or (Fisherian ■ ) fitness, of species i. It is assumed to depend 
on the population fractions of all other species. The replicator e quatio ns are at the 
basis of the mathematical description of evolutionary dynamics! ^^ 

The first term in the right-hand side of Eq. ([2|) is obtained from ([5]) for /i(x) — 
—pu{xi). In bit-string language models, therefore, the "fitness" to be assigned to 
each language is a given function of the corresponding population fraction, the same 
for all languages, and does not depend on the populations of other languages. The 
negative proportionality between u and fi points out that the function u{x) defines 
a kind of "unfitness." In fact, it measures the probability with which a language is 
abandoned for adoption of another one. 

Though the general solution to the replicator equations is not known, some 
generic mathematical properties make it possible to characterize aspects of the 
evolution and of its asymptotic state. These properties can be straightforwardly 
translated to Eq. ^ when /x = 0. First, the ratio of the populations fractions of 
any two languages i and j satisfy the "quotient rule" '^ 
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which depends on Xi and Xj only. It can be formally integrated, yielding 

Suppose now that the initial population of language i is larger than that of j, 
i.e. Xi{0) > Xj(0). Since u{x) is a decreasing function of x, the initial value of 
u{xj) — u{xi) is positive. This implies that the ratio Xi/xj will grow, making the 
difference between the two populations larger. This kind of feedback effect will be 
enhanced as time elapses. If, on the other hand, Xi{0) < Xj{0), the ratio Xi/xj 
will monotonically decrease along the evolution. Since the normalization condition 
^ holds, this result suggests that, generically, the language which initially has the 
largest population fraction, say language i, will asymptotically accumulate the whole 
population, Xi{t) — + 1 as i — > cxd, while the fractions of all the other languages will 
asymptotically vanish. The only exception to this behaviour happens if two or more 
languages have exactly the same initial population. For two of these languages, 
in fact, u{xj) — u{xi) — 0, and Xi and Xj remain identical at all times. If their 
initial populations are larger than that of any other language in the system, they 
will asymptotically share the whole population in equal parts, and the remaining 
populations will vanish. 

The same results arise from a global stability analysis of Eq. ([2]), based on the 
fact that, for /i = 0, it admits a Lyapunov functional,'^^ 

TV 

E 



Its time derivative is 



C/(x) = V / u{x)dx. (8) 







U = y^uix,)x,=p{{u)^~{u^))<0, (9) 



i=l 



with (u) = '^^Xiu{xi) and (u^) — '^■Xiu{xi)'^ . Due to the definiteness of the sign 
of U, Lyapunov's theorem holds and, in particular, the asymptotic state of the 
differential system is in the manifold where U = 0, i.e. where (u)^ = (w^). This 
condition, along with Eq. ([T]), is met if iVb among the N population fractions are 
equal to N^^ (1 < -/Vq < N), while the remaining N — Nq fractions are equal to 
zero. As advanced from the analysis of Eq. ([7]), the asymptotic state consists of a 
number of languages with identical non-vanishing populations, while all the other 
languages are absent. 

Note that the homogeneous stationary state where x* = N~^ for all i, obtained 
in Section 13.11 for the case without switching, is also found among the equilibrium 
solutions of the system without mutation. Here, however, this equilibrium solution 
corresponds to an extremely special initial condition, where all the initial popula- 
tions are identical. As a matter of fact, all the equilibria with Nq > 1, where more 
than one language survives asymptotically, are rather special, since they require 
that two or more populations are initially identical. These equilibria would not be 
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robust under the effects of fluctuations in the initial condition, or of noise during 
the evolution. In the most generic case, on the other hand, there is a single maximal 
population. The above analysis shows that this population will grow to the expense 
of the others. In the absence of mutation, thus, the system generically approaches 
a state where only one language survives. This dominant language accumulates the 
whole population. 

4. Stationary states with mutation and switching 

Under the combined effects of mutation and switching, it is expected that the system 
approaches a state where the population is distributed among languages in a way 
that interpolates between the stationary solutions discussed separately for each 
mechanism in Sections 13. II and [3. 21 With the notation introduced there, Eq. ([2]) can 
be rewritten as 

N 

ii = pxi[{u) - u{xi)] + —^^DijXj. (10) 

Stationary solutions are given by equating the right-hand side to zero for every 
i ~ 1,2, ... ,N, and solving for Xi. It should be clear by now that a particular 
solution is the homogeneous distribution x* = N^^ for all i. Numerical simulations 
of agent-based bit-string language models confirm the presence of this stationary 
state of homogeneity for large mutation rates. For small fi, on the other hand, 
they suggest that there is a stationary solution with a dominant language, which 
accumulates most of the population, together with several less populated languages. 
In terms of the results of Sections 13. II and [3. 2i this solution should be interpreted as 
the consequence of the interplay between the replicator dynamics of switching, which 
concentrates population in a single language, and mutation, which redistributes part 
of the population among "dialects" around the dominant language. 

Generally, a joint Lyapunov functional for replicator dynamics and diffusion does 
not exist, so that a global stability analysis of Eq. (fTO|) is not possible along the 
lines used in Section [3.21 To my knowledge, the only exception is the case of linear 
unfitness, u{x) = I — x, where the functional 



N N N 

P Y^ 2 2 P V^ 3 , M Y^ 

- > Xj X,- — - > Xj + —- y 

z,j — 1 z— 1 i J — 1 



y(x) = ^ y^ ^f^^.Ly^f + ^y D,,x.x,, (11) 



satisfies Xi = dV/dxi, so that V >Q. 

In any case, the local stability of the homogeneous state x* ~ N^^ can be 
analyzed by standard linearization of Eq. (fTO|) . From this analysis it turns out that 
the homogeneous state is a stable equilibrium above the critical mutation rate 

Pi^^\uiN-% (12) 

where u'{x) is the derivative of the unfitness function. Note that, unless u{x) has 
a singularity at a; = 0, the critical mutation rate is very small if the number of 
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languages N is large. This seems to strongly disagree with numerical resultsM^with 
p = 1, L = 8 {N = 256), and u{x) = 1 — x^, agent-based simulations indicate that 
homogeneity is asymptotically approached for fi > 0.14, while the above equation 
predicts /ii ^ lO^''. As I show below, however, this discrepancy is fallacious. 

Performing a linear stability analysis for the equilibrium state where the popu- 
lation is concentrated around a dominant language, requires first to explicitly find 
the corresponding stationary solution x*, which turns out not to be trivial at all. 
The equilibrium equations derived from Eq. ()10p couple nonlinearly, through the 
average (u), all the population fractions x*. An approximate solution can however 
be found in the limit of small mutation rates, /i <C 1, when essentially all the pop- 
ulation speaks the dominant language. Assuming that the equilibrium population 
fraction x*i of a language at Hamming distance d from the dominant one is of order 
/Li'', and keeping only the most significant terms in powers of /i, yields 



^^'1^ 



(13) 



The population fraction of the dominant language, Xq, can be obtained from the 
normalization condition and, within the same approximation order, is Xg = 1 — 
/i/p. Note that this solution for small /i is independent of the form of u(x). This 
is in agreement with numerical simulations for very small mutation rates, which 
have found no sensible dependence on the unfitness function. Note also that the 
relevant small parameter in this approximation is the ratio /i/p. Linear stability 
analysis of the solution (|13p shows that, as expected, it is stable for small ^. An 
instability is predicted for ^2 = P, but this critical value of the mutation rate is 
outside the validity range of the approximation. Comparing with Eq. (|12|) . however, 
this result suggests that there may be an interval in /x, just above pi, where both the 
homogeneous state and the dominance state are stable. Such behaviour would be 
consistent with known features in models of language learning with mutations.'^'^ 
In the following section, I support this conjecture from a different approach, and 
validate the result by means of numerical resolution of Eq. (jlOp . 



5. Bistability of dominance and homogeneity 

In order to progress beyond the limit of small mutation rates, the solution to Eq. 
(fTO|) must be approached from a different perspective. I thus focus the attention 
on the evolution of the maximal population fraction all over the system, Xmax, 
which I assume is reached at a single language. Furthermore, invoking a kind of 
mean-field approximation, I suppose that the population fractions of the remaining 
N — 1 languages are mutually identical. Due to normalization, their value is (1 — 
Xma.x)/{N — 1). Replacing in Eq. (jlOp yields an evolution equation for .Tmax^ 



pxi, 



^[L ^maxj 



1 



N- 1 



u{xn 



Nt ~ 1 
N ~1 



(14) 
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Encouragingly, x^-^^^ — N~^ is an equilibrium solution of Eq. ()14|) . for any form 
of u{x). For this solution, the population fractions of all the other languages are also 
N^^, so that Eq. (fT4|) correctly predicts the existence of a stationary homogeneous 
state. On the other hand, its stability threshold is predicted by linearization of Eq. 
(fTi]) at fli = p{N - l)\u'{N-^)\/N'^ which, for large N, differs from the critical 
mutation rate given in Eq. (fT2|) by a factor of two. This is however understandable, 
taking into account that not any deviation from homogeneity is compatible with 
the present approximation, so that not all eigenvalues of the original problem are at 
work in Eq. (1141) to bring the system to equilibrium. Nevertheless, the dependence 
of fli and fii on N is the same and, in particular, fli tends to zero for large N. 

Note also that, for fi — 0, x^^^ = 1 is always an equilibrium solution. It turns 
out to be stable for any decreasing function u{x). This solution corresponds to the 
dominance state in the absence of mutation, where all the population speaks the 
same language. 

The existence and stability of other equilibrium solutions for Eq. (J14p depend on 
the form of the unfitness u{x). For the sake of concreteness, I discuss the problem 
in the limit of large N, where the equation for the equilibria of Eq. (fH]) takes the 
simpler form 

Here, I have assumed that u(0) = 1. The homogeneous state is now given by the 
trivial solution a^Jj^^x ~ ^- ^^ '^^^ be easily realized that, if u{x) decreases with x 
and fi is positive but not too large, two additional solutions exist in the interval 
(0, 1). The larger solution is a stable equilibrium of Eq. (fT4| . and tends to xj^j^^ = 
1 for fi —f 0, thus corresponding to the dominance state. The lower solution is 
unstable, and approaches zero in the same limit. As the mutation rate grows, these 
two solutions approach each other, and eventually collide and disappear through a 
tangent bifurcation at a critical mutation rate fi2- 

The scenario is illustrated in Fig. [l]for three forms of the unfitness u{x). The 
curves show x^^x ^^ ^ function of the (normalized) mutation rate /i/p, corresponding 
to the dominance state (full line) and the unstable solution (dotted line) . For these 
low-degree polynomial forms of u{x) the critical point of the tangent bifurcation can 
be exactly calculated. For u{x) — (1 — x)^, l — x, and (1 — x)^, the respective critical 
mutation rates are /i2 — 4p/27 ~ 0.148p, /9/4, and 2p/3\/3 ~ 0.385/3. The first 
value is in very good agreement with the critical point reported from agent-based 
numerical simulations (/i = 0.14 for p = 1 and N = 256), and makes it possible 
to identify the transition detected numerically as the tangent bifurcation at H2- 

As advanced at the end of SectionJH the present results predict that, for mutation 
rates satisfying pi < p < p2, both dominance and homogeneity are stable states for 
the distribution of population among languages. In other words, two stable solutions 
coexist and can be asymptotically approached during the evolution. The asymptotic 
state is selected by the initial condition: in Eq. (fT4| . the attraction basins of the 
two solutions are separated by the intermediate unstable state. This dependence on 
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Fig. 1. The two solutions of Eq. I I15I I in the interval (0,1) as functions of the normalized mutation 
rate ^l/ p, for three forms of the unfitness function u{x). Full lines stand for the stable equilibria 
of Eq. I I14II . in the limit N —* oo, corresponding to the dominance state. Dotted lines indicate 
unstable equilibria. 



the initial condition has also been noticed in agent-based simulations ! I I 

In spite of the agreement with simulations, Eq. (|14[) remains the outcome of a 
rather rough assumption on the population distribution over languages. It is there- 
fore worthwhile to compare its predictions with results from the numerical resolution 
of Eq. (jlOp. This is also an opportunity to consider relatively small values of N, 
which have been disregarded in Eq. (fT5| . Curves in Fig. [2] show the equilibria of Eq. 
([T4| as functions of the mutation rate /i, for L = 5 {N — 32), p = 1, and two forms 
of the unfitness u{x). Dots stand for the maximal population fraction obtained, at 
long times, from the numerical resolution of Eq. (jlOp with each function u(x). Two 
kinds of initial conditions have been considered. In one of them, the initial popu- 
lation fractions are chosen at random, all of them close to the homogeneous state, 
and satisfying the normalization ([1]). In the other initial condition, one of the pop- 
ulation fractions equals unity, and all the other are zero. As expected, the former 
are found to asymptotically approach the value x'^^^^ ~ N^^ w 0.031. When the 
initial population is concentrated in a single language, on the other hand, x'^^^ is 
relatively large for mutation rates below the tangent bifurcation, and drops to N~^ 
above it. The agreement between the results from Eqs. (fTO|) and (fT4|) is excellent 
for small values of fi and, perhaps not unexpectedly, worsens as the critical point 
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Fig. 2. Pull and dotted curves respectively stand for the stable and unstable equilibrium solutions 
to Eq. II14I I. as functions of the mutation rate, for p = 1 and A^ = 32 (L = 5) and two forms of the 
unfitness u(x). Dots correspond to long-time measurements of Xmax from numerical resolution of 
Eq. l llOI I for each form of u(x). 

fj,2 is approached. However, the overall pictures are qualitatively identical. 



6. Conclusion 

In this paper, I have presented a formulation of bit-string models of language evolu- 
tion based on differential equations for the fraction of the population speaking each 
language. The formulation highlights the fact that these models conceive language 
evolution as combining a diffusion mechanism, given by mutation between simi- 
lar languages, with replicator dynamics for language switching, when an individual 
adopts the language of a randomly selected member of the population. The combi- 
nation of replicator dynamics and diffusion is not new in the literature of biological 
evolution models. In contrast with bit-string language models, however, these ap- 
plications to biological evolution often assume that the fitness of individual s of ea ch 
species is independent of the population, though it varies between species! ^ ^ In 
bit-string models, on the other hand, the dependence of the unfitness on the popu- 
lation fraction is essential to represent preference for more widespread languages. 

The decrease of the unfitness of a language when its population grows is the 
key ingredient which shapes the behaviour of bit-string models. In particular, the 
occurrence of a tangent bifurcation where the dominance state -with most of the 
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population speaking the same language- disappears, and the existence of a param- 
eter region where dominance and homogeneity coexist and are stable, are direct 
consequences of such dependence. This bistability opens the possibility that, in a 
system of languages divided into weakly-interacting domains, where inter-domain 
switching is much less likely than intra-domain transitions, some domains converge 
to the dominance state while others approach homogeneity, even when the evolution 
parameters are identical all over the system. 

It is worth remarking that a dominance-homogeneity transition is known to 
happen in genetic space within r eplicat ion- mutation models of molecular evolu- 
tion, specifically, in Eigen's model! ^ ^ Though the involved critical phenomenon 
is mathematically different from the tangent bifurcation disclosed here, they are 
qualitatively much the same from the viewpoint of the competing balance between 
the basic mechanisms at work. 

It would be interesting, by inspiration from mathematical studies of biologi- 
cal evolution, to add bit-string models with some non-uniformity in the individual 
properties of languages, for instance, assigning different parameters to the unfitness 
function of each language. In fact, it may well be that the preference for a very 
widespread language is inhibited by intrinsic difficulties to acquire it. Vice versa, 
a language whose acquisition is perceivably easier could be chosen as a common 
communication means by populations speaking more widespread but more intricate 
languages. Do you imagine the whole of mankind eventually deciding to speak Man- 
darin and its dialects'?^ Think also of Mark Twain's "The horrors of the German 
language.' "^ 

Let me finally point out that comparison of numerical results from bit-string 
models and empirical data of language statistics -specifically, the distribution of 
language sizes, and the abundance within language families- has been focused on 
transient stages of the evolutionary process. In agreement with this, though not 
analyzed in detail here, the present study shows that asymptotic states in bit-string 
models are not a good representation of empirical observations. Transient effects in 
the present formulation could be addressed by numerical resolution of Eqs. ([2]), but 
such analysis would deserve a separate presentation. 
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